SOME IDENTITIES AND FORMULAS INVOLVING 
GENERALIZED CATALAN NUMBERS ^ 

Siu-Ah Ng 



A generalization of the Catalan numbers is considered. New results in- 
clude binomial identities, recursive relations and a close formula for the 
multivariate generating function. A simple expression for the Catalan 
determinant is derived. 
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1. Introduction 

By a path, we mean a finite sequence {ai}o<i<ri where € and either a^+i = 

fli + (1, 0) or fli+i — ai + (0, 1). That is to say, a walk from oq to a„ whose steps 

consist of horizontal or vertical positive unit movements. 

The classical Catalan number is the number of paths from the origin to {n, n) 

1 f2n\ 

without crossing the diagonal. It is given by ) and has many well-known 

n + 1 \n J 

interpretations (Cf jl]). In [Jj we came across the following natural generalization: 

Definition 1. Let < m < n. The generalized Catalan number is defined as the 
number of paths from (0, —2m) to {n ~ m,n ~ m) without crossing the line y — x 
and is denoted by Cn,m- 

2m +1 / 2n \ 

For completeness, we give in ^ a proof of C„ ,„ ~ ( ) . The 

n + m + 1 \n + mj 

number Cn.m has an ancient history. In pP Bertrand studied a„^„i, the number 
of paths from the origin to (n, m) without crossing the diagonal, as the number 
of solutions to the ballot problem. One verifies that Cn,m — o,n+m.n-m, n > m. 
(For another derivation, one can check that Cn,m = i^(i-2m)(i+n-m) foi' the dqk 
defined in [S].) 

We then move on to proving new identities and formulas. 

In ^we consider some variations of Cn.m and compute closed formulas for them. 
In ^ we derive some binomial identities by counting Catalan numbers and their 
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variants. In fJSlwe obtain recursive relations for the generalized Catalan numbers, 
from which identities for the central binomial coefficients and an identity for cer- 
tain sum of products of the generalized Catalan numbers are given. In the main 
section fj^we give a close formula for the multivariate generating function for the 
generalized Catalan numbers. In SQmore formulas are produced, including certain 
product of the generating functions. We end in the final section ^Hlwith an elegant 
formula for the Catalan determinant. 



2. Generalized Catalan numbers 

For completeness, we give a direct derivation of a close formula for Cn,m although 
as remarked earlier it is already included in references such as ^ and [1]. 

Theorem 2. Given integers < m < n, we have 

2m + 1 / 2n 



(1) Cn,m 111 I 

n + m + 1 \n + my 

Proof. Fix < m < n. We let A be the set of paths from (0, —2m) to (n — m,n — m) 
and r be the subset of "bad" paths, i.e. those that cross the line y = x. So 

Cn,™ = |A| - |r|. 



Given a path P eT, let k be smallest such that P crosses y — x at (fc, k), i.e. both 
(fc, k) and (fc, fc + 1) belong to P. Let Q be the path obtained from P by reflecting 
across the line y = x + 1 only the portion of P from (0, —2m) to (fc, fc + 1). The 
result is a path from (—2m — 1, 1) to {n — m,n — m). See Figure ^ above. This 
is simply the Andre's reflection method and one can easily see that it establishes 
a bijection between F and the set of paths from (—2m — 1, 1) to {n — m,n — m). 
Therefore 

^2(n- m) - (-2m- 1 + 1)\ _ / 2n 



[n — m) — 1 J \n — m — 1 

On the other hand, 

_ /2{n - m) - {-2m)\ f 2n \ f 2n 
\ n — m J \n — mj \n + m 

Hence 

r — 



( ) 




L 


2n 




\n + mJ 






- m ~ 
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Bertrand's ballot problem: Suppose the final result of an election is such that 
party A gets 2m more votes than its opposition B, say A gets n + m and B gets 
n — m votes. What is the probability that A is at least 2m votes ahead of B in all 
partial counts throughout the whole process? 

Corollary 3. The probability of the event in the generalized ballot problem is 
2m + 1 

n + m+1 

Proof. By shifting all paths 2m units up, it is clear that Cn,m is the number of 
paths from (0, 0) to (n — m, n + m) without crossing y = x + 2m. Then identify 
each vote for party A with (0, 1) and that for party B with (1,0). 

The conclusion follows by noticing that each possible ballot outcome in the 
problem corresponds to a path from (0,0) to {n — m,n + m), hence there are 
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[ I of them in total, and those baUot outcome with A leading B by at 

\n + m J 

least 2to votes corresponds to paths from (0, 0) to {n — m, n + m) without crossing 
y — X + 2m. □ 



3. Some variations 

Superficially, the definition of Cn,m seems to be too restricted, since it considers 
only the even number 2m and allowing touching y = x. Here in this section we 
consider some variations and they can be computed in similar ways. 

Definition 4. Let < m < n. 

Cn,m ■— the number of paths from (0,— 2m — f) to {n — m,n — m) without crossing 
y^x. 

T^n.m '■— the number of paths from (0,— 2m) to {n — m,n — m) strictly below 
y — X until reaching {n — m,n ~ m). 

T^n.m '■= the number of paths from (0, — 2to — f ) to [n — m,n — m) strictly below 
y = X until reaching (n — m,n — m). 



Theorem 5. Suppose < m < n. Then 

_ 2m + 2 f2n+l 



(2) V„ 



n + m + 2 \n ~ m J ' 
2m /2n - V 



(3) T)n,m — 



n + m \n — m J ' 

2m + 1 ( 2n 



n + m + I \n — my 

Proof. We first prove (2). Notice that a path P from the set defining 'Dn,m must 
pass (n — TO, n — m — f ) before reaching (n — m,n — to). Similar to the proof of 
Theorem|21 P can be reflected across the line y — x to form a path Q from (— 2to, 0) 
to {n — m,n — m — 1), as shown in Figure|21 
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{n—m, n—m) 
'n—m, n—m—1) 



(0, -2m) 



Figure 2. Reflection across y = x. 



We let A be the set of paths from (0, — 2m) to {n — m,n — m—l) and F be 

the subset of "bad" paths, i.e. those that cross the line y — x — 1. Clearly, 

|A| = I ). By the reflection trick used in the proof of Theorem |2 we can 

\n — m J 

identify paths in F with paths from (— 27ti,0) to (n — m,n — m — 1). Therefore 

\ ;t -f III I 

2n-l\ /2n-l\ 2m /2n-l 



\n + m J 
So = |A| - |F| 



n — m I \n + m J n + m \n — m 



To prove (1), by shifting 1 unit down, we can identify paths defining Cn.m with 
paths from (0, — 2m — 2) to {n ~ m,n — m) strictly below y ~ x until reaching 
(n ~ m,n — m). Therefore 

and the conclusion follows. 

Now for (3), we notice that by shifting 1 unit up, paths defining are 

identified with paths from (0, —2m) to {n — m,n — m) without crossing y = x. 

Therefore V^m = Cn.m- □ 
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4. Identities 

In this section we derive some interesting identities which are corollaries of 
Theorem |5] or Theorem [S] 



Corollary 6. 

y.^ 1 (2k + 1\ /2(n -k)\ _ 2 / 2n \ 
^ ^ ^ 2k + l \k-mj [ n- k J 2m + lUi - m - Ij ' 

k—m 

where < m < n — 1. 

Proof. A "bad" path P from (0, — 2m) to {n — m,n — m), i.e. one that crosses 
y — X, is a path from (0, —2m) to some (fc, k), immediately before crossing y — x, 
then proceed to {k,k+l) and takes any path to (n — m,n — m) as shown in Figure|3| 
below. 




Figure 3. Decomposition of a "bad" path P. 
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The number of paths from (0, — 2m) to {k,k) = {{m + k) — m, {m + k) — m) 



without crossing y — x is given by Cm+k.m, while the number of paths from (fc, 
to {n — TO, 
paths are 



to {n — m,n — m) is given hy [ \ 1 . Hence the number of the "bad" 

n — m ^ k 



" ^ ^2(n — TO — fc) — 1 



n-1 



n — m ' 



k 



k=Q 
n-1 

E 



2(n- fc) - 1 
n — k 



2to+ 1 ( 2k \ (2{n-k) - 1 



fc + TO+lVfc~"^/V n — k 

k—Q 

f 2n \ 

Since this number is also ] ~ Cn m the conclusion follows from the last 

\n — m J 

part of the proof of Theorem |3 □ 



Corollary 7. 

1 ( "^^ \ P*-" ^ _ 2to + 1 / 2n 

^-^ fc(n — fc + 1) \k~mj \ n~k J TO(n + TO+l) \n — m 

k=m / \ / \ / / \ . 

where 1 < to < n — 1. 

Proof. Note that a path from (0, —2m) to [n — m,n — m) without crossing the line 
y = X can be decomposed into a path from (0, —2m) to the first touch at ?/ = x, 
some {k,k), < k < n — to, followed by a path from {k,k), to (n — m,n — m) 
without crossing the line j/ = x, as shown in Figure 01 below. 
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The number of the former paths is I?fe+„i,m Sind the number for the latter is 
Cn-m-k.Qi therefore we have 

n—m 

(4) Cn,m ^ ^ ^k+m,7n ^n — 7n — k,0- 

k=0 

Hence, by Theorem |21 and Theorem [SJ 
2m +1 f 2n \ _ 2m /2(fc + m)-l\ 1 /2(n - m - fc) 



n + m + 1 — mj k + 2m \ k Jn — m — k + l\n — m — k 



2m (2k -\\ 1 (2{n-k) 



k + m \k — m J n — k + 1 \ n — k 

k—m 



E 

k—m 

71 

E 



2k \ (2{n~ k) 



k{n^k + l) \k — mj \ n — k 



and equation © foUows. 



□ 
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Remark 8. It is easy to check that equation 0) can be transformed into the fol- 
lowing form: 

2n-2k + l f 2k \/2(n-fc)^ 



(5) E 



(k -\- m -\- l)(n — k 1) \k — mj \ n—k 

k—m 



(2n + l)(2n + 2) / 2n 



(2m + l){n + m + l){n + m + 2) \n — m 
Although equations ^ and 0) are similar, it is not clear how they are related 



Corollary 9. 



(6) 



E 

/i=0 



'2h + k- 1\ f2n -2h-k 
h + k \ h J \ n — m h 



2n 

n ~ m 



where < to < n — 1 and 1 < k < 2r 



2n 



Proof. The number of paths from (0, — 2to) to {n — m,n — to) is , the 

\n ~ mj 

right side of the above equation 10. 

For each 1 < A; < 2to we can decompose these paths into the path from (0, —2m) 
first touching the fine y — x — 2m + fc at (h^h — 2m + fc), for some < h < n ~ m, 
foUowed by a path from (ft,, h — 2m + fc) to [n — m,n ~ to), as shown in Figure [3 



below. 




{n—m, n—m) 



Figure 5. The decomposition at (/i, h — 2m + fc). 
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The number of paths from {h, h — 2m + fc) to (n — m, n — m) is ( 

\ n ~ m ~ h 

By shifting up 2m — k units, we can see that a path from (0, —2™) first touching 
the hue y = x — 2m + k at {h,h— 2m + k) corresponds to a path from (0, —k) first 
touching the hue y — x at {h, h). Therefore, if k is even, the number is given by 

_ k /2/i + fc-l' 

- /Tfl ^ h 
if k is odd, the number is given by 

k (2h + k-l 

both producing the same number. 

k f2h + k-l\ f2n-2h- k\ „ , 

humming the product over < n < n — m, 

h + k \ h J\n — m — hj 

equation © follows. □ 



Remark 10. It appears that equation ^ actually holds for all k > 0. The current 
proof doesn't seem to work for k > 2m. 



5. Recursive relations and identities 

In this section we will derive two recursive relations for the generalized Catalan 
numbers. As corollaries, we obtain some identities for the central binomial coeffi- 
cients. The first recursive relation for the generalized Catalan numbers we consider 
is the following. 



Theorem 11. For < m < n, we have the following recursive relation 

n(2m + 1) 
(n — m){m + 1) 



(7\ r ~ "(^"^ + ^) \^ r r 

Proof. A path defining Cn,m is either a path touching the line y = x only once at 
the terminal point {n~m,n^m) (the number of such is given by Vn^m ) or a path 
touching some (fc, k), where 0<fc<n — m — 1 for the last time before hitting the 
terminal point {n — m,n — m), as shown in Figure El 
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Figure 6. A path P touching y = x &i {k,k) for the last time 
before touching the terminal point (n — m, n — m). 

Write {k, k) as ((fc + m) — m, (fc + m) — m), we have 

n — m — 1 



But 



fe=0 



2m /2n - 1 



n + m \n — m 



m{n + m + 1) 
n(2m+ 1) 



hence the result follows. 



□ 



As a consequence, we have an identity for the nth central binomial coefficient. 
Corollary 12. Let < m < n, then 
(8) 



2n\ _ n + m+1 (TT"- + ^\ ^ 

~ ^ Z I I I r I / , ^k+m.m^n — m — k — 1,0- 

m+1 \ n — k- I ^ — ' 



n — k , 

\k=0 / fe=0 



Proof. By equation ((TJ in Theorem II II we have: 
2m +1 (2n)! n(2m + 1) 



n + m+1 (n + m)! (n — m)! (n — m){m + 1) 



^ ^ ^/^+m, m ^n— m — fc — 1, 0? 



fe=0 



I.e. 



(2n) 



n+m+1 n 



(n + m)!(n — m)! m+1 n — m 



^ ^ ^fc+rn, rn ^n— m— fc— 1, 0; 



fc=0 
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hence the resuh foUows if m = 0; for to > 0, we get 

(2n)! n + TO+1 n{n + 1) ■ ■ ■ {n + m) " ^ ^ ^ 

n'. TO + 1 n[n — 1) ■ ■ ■ (n — m + l)[n — m) ^— ^ 

and equation ^ is proved. □ 

The following is an identity to be used in ^ 
Corollary 13. Let 1 < m < n, then 

n—m — l ^ / rn \ f ^ \ 

(9) E 

m, rn^n—rn — k—l. — 
fe=0 ■ \k=l ) \k=m\2 ) 



n 



71 + fc 
fc 



Proof. This is just another form of equation ((SJ in Corollary II 21 bv noticing that 

2n\ -j^ n + k 



n J k 



□ 



Now we consider another recursive relation for the generalized Catalan numbers. 



Theorem 14. Let < m < n. Th 



en 



2m + 1 ^-^ k 

k—ni 

Proof. From Theorem 5 in we can obtain 

_ m{n + TO + 1) 
n[2m + 1) 

The result then follows from the identity (4) of Corollary 7 in [Jj. □ 

From the above we have another expression for the nth central binomial coeffi- 
cient. The proof is similar to that of Corollarvll2l 

Corollary 15. Let 1 < m < n, then 

(2n\ m{n~m) ("^ n + k \ ^ k + m + l 
^^^> [n) - (2to+1)2 1 U n-k + 1 ^ k ^'^■"^^-'^■O. 

□ 
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6. Generating functions 

In this section we derive a formula for the generating function of generaUzed Catalan 
numbers. A comprehensive teatment of generating functions can be found in |[3|. 

For fixed m > we let jm{x) denote the generating function of Cn+m,m, i-e. 
the power series 

oo 
n=0 

It is well-known for the classical Catalan numbers C„ q that 



= 2-x • 

(See for example 0.) 

We denote the corresponding multivariate generating function by: 

oo 

(13) r{x,y) = C„+™,™x"y". 

n,m— 

In a formula for the generating function for an,m is given, i.e. 

, ^ n m 1 - V lojxy) 

a{x,y) := } a^^mX y = — . 

^-^ 1 — X — y 

n>m>0 

Using the relation Cn,m = o,n+m.n-m, it can be verified that 

nx,y) = l{a{Vy,^) +a{-Vy,-^) 

However we have another derivation here: 



Theorem 16. 



(14) r{x,y)- x(l-Vr^) 



1 - 2/(7o(a;))2 ■• 2x(a; + 2/)-2/(l- Vl-4a;) 



We first need the following lemmas: 
Lemma 17. Let n >0 then 

(15) Cn-\-m+l,m + l = ^ ] Ca+m, m Cb, ^c, 

where we sum over a, b, c > 0. 



a+o+c — n 



□ 
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Proof. Wc count paths defining C„+„i+i,m+i, i-e. paths from (0, — 2(?7i + 1)) to 
(n,n) — (n + TO + 1 — (to + 1), n + m + 1 — (m + 1)) without crossing = a;, by 
using their first touch (a, a — 1) and (a + 6, a + 6) at the lines y = x — 1 and y = x 
respectively as in Figure H below. 




Figure 7. A path defining Cn+m+i,m+i whose first touch a.t y = 
X — 1 is (a, a — 1) and whose first touch at y — x is (a + b, a + b) 
for some a,b > and a + b < n. 



The number of paths from (0, —2m — 2) that first touch y = x — I at (a, a — 1) 
is the same as those reaching (a, a — 2) without crossing y = x — 2, i.e. the number 
is Ca+m.rn- Similarly, the number of paths from (a, a — 1) first reaching y — x at 
{a + b, a + b) is C^fl and the number of paths from (a + 6, a + 6) to (n, n) without 
crossing y — x is Cc,o, where c — n — a — b, hence the result is proved. □ 

Now we have a recursive relation for the generating function 7^ (x) : 
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Lemma 18. For m > 0, we have 

(16) %n+i{x) = 7m(x) (7o(a;))^. 

Proof. We use Lemma [T7I 

oo 

7m+l(2^) = Cn+rn+l,m+l 
oo 



n— a+6+c— n 
/ oo \ / oo 



= 7m(a;) (7o(a;))^ 



\n=0 / \n=0 

\2 



Now we prove Theorem 1 1 61 from Lemma II8I Note that: 

00 



?TI = 



= r(a;,y) - 70(3;), 

from which the result foUows. 



□ 



7. More formulas 
In this section we prove a few more fomulas relating to the generating functions. 

Theorem 19. Let m > 1. The generating function 'jmix) is equal to 

2x / , I -Py n + m - k + l\ n + + k + i\ „\ 
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Proof. Replacing n — m—1 by n in CoroUarvll^leaiiation 0, we obtain for n > 1 
that 

El n + m — k + l\ I'^+p^^ n + m + k + l\ 
Ck+m.mCn^k.O - \ll ^ [[ ^ • 

k=0 \k=l / \fc=m+2 / 

Then the theorem foUows from 

oo 

7m (a;) 70 (x) = X! X! Ck+m,TnCh,ox"' 
n—Q k+h—n 
oo / n \ 

= I Ck+m, m Cn-k, j 2:" 

n=0 \fc=0 / 

n + m — k + l\ / T-r n + m + /e + l\ 



We now the product 7m (a;) 70 (a;) and the foUowing power series: 

N (n + l)(m + l) 

v'^'' ^ / J ( I l1^l'o l1^ '-'"+"1+1. m a; . 
^-^ In + m + l)(2m + 1) 

n— ^ 



Theorem 20. Let < m < n. Then 



□ 



1 _ _ 

6'(a:) = 7>n(a;)7o(a;) = 7m (a;) ■ 



2a; 



Proof. From Theorem II II equation Q we have 

+ m + l)(2m 4 
(n + l)(m + l) 



_ (n + m+l)(2m+l) ^ 

I T) -k I W m 4- 1 I ■'^^ — ^ 



k-\'h—n 

therefore 

1 - VI - 4x 



n—O \k-\-h— 71 



2x 



□ 



Corollary 21. For 1 < to < n, we have 

, , _ (n + TO)(2TO+l) (^n + m — k\ f tt" n + TO + fc\ 
(17) C„+„,m- M| ^ II II I. 

^ ' \k=l J \fc=m+2 / 

Proof. Compare the power series expansion of 7m (x) 70 (a;) given in the last line of 
the proof of Theoremll9land the one given by d(x). Then equation (|17|l is obtained 
by replacing n + 1 for n. □ 
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8. A BEAUTIFUL DETERMINANT 

In this final section, we prove the following: 
Theorem 22. For integer N > 0, 



det 



{2i + l) {2{i + j))l 



i!(2i + j + l)! 



0<i,j<N 



□ 



We suggest the name Catalan determinant for the above, for the entries of the 
matrix are the generalized Catalan numbers C,;+j, i . 

First we need a slight modification of 8 Theorem 1. (See also [S]-) An almost 
identical proof is given here for completeness. 

Lemma 23. Let f{x) = 1 + aix + a2X^ + . . . be a formal power series and let 



^i,3 ~ [2^"']/^'^"'^ (2;), where [x^] denotes the coefficient of in the series. Then 



det [ci j ] 



3iO<iJ<N 



(2ai 



Proof. Let 



^ — lc^,j]o<^,J<N and B := [6,,i]o<,j<jv wliere 



(-1) 



i+j 



The (j, A:)-entry of BC is given by 

N N 

"^j^fe ^ 

j=0 



which equals (2ai 



J=0 



3=0 



-1)' ((1 - f{x)y fix)) 

-ly [x^] (^{-2aix + ...y {1 + aix + ...)) , 



if k — i and equals if k < i, hence BC is an upper 



triangular matrix with diagonal entries (2ai)*. In particular 



N 



det BC = Yli2aiy = (2ai)' 



But B is lower triangular with diagonal entries 1, so dctB = 1 and the result 
follows. □ 
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To prove Theorem|221 we let, as in ^ Ji{x) be the generating function of Ci+j^ i, 
i.e. the power series 

oc 
n=0 

Then 

1 — \/l — 4a; 9 T /I 

7o(a;) = ^ = l + x + 2x^ + 5x^ + Ux^ + ... 

2x 

and Lemma ITRl gives that 

7^(2;) = lo^\x) for i > 0. 

Therefore 

- [x^Mx) = [x^H'^\x) 
and Theorem 122 is now proved by applying Lemma l23l to f{x) — ja{x). 
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